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Abstract: Many proposed quantum mechanical models of black holes include highly nonlocal in- 
teractions. The time required for thermalization to occur in such models should reflect the relaxation 
times associated with classical black holes in general relativity. Moreover, the time required for a par- 
ticularly strong form of thermalization to occur, sometimes known as scrambling, determines the time 
scale on which black holes should start to release information. It has been conjectured that black holes 
scramble in a time logarithmic in their entropy, and that no system in nature can scramble faster. In 
this article, we address the conjecture from two directions. First, we exhibit two examples of systems 
that do indeed scramble in logarithmic time: Brownian quantum circuits and the antiferromagnetic 
Ising model on a sparse random graph. Unfortunately, both fail to be truly ideal fast scramblers for 
reasons we discuss. Second, we use Lieb-Robinson techniques to prove a logarithmic lower bound on 
the scrambling time of systems with finite norm terms in their Hamiltonian. The bound holds in spite 
of any nonlocal structure in the Hamiltonian, which might permit every degree of freedom to interact 
directly with every other one. 

Keywords: scrambling, signalling, black holes, thermalization, entanglement, Lieb-Robinson bounds, 
mean-field approximation 
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1 Introduction 

There is a growing consensus based on evidence from string theory and gauge-gravity correspon- 
dences that black holes do not destroy information when they evaporate. Roughly, the argument is 
that black holes can be realized in string theory in a manner that accounts for their entropy [1-7], and 
that certain string theories are equivalent to manifestly unitary systems [8-10]. For a recent review, 
see [11]. 

Instead of being lost, information about the microscopic state of the black hole leaks out with 
the hole's Hawking radiation, much as it would for any other radiating object. Early estimates for 
the amount of time it would take to recover a bit from a black hole, however, suggested that no 
information would leak out for an amount of time proportional to the black hole lifetime [12-14]. 
Since astrophysical black holes have lifetimes many orders of magnitude longer than the age of the 
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universe, that is tantamount to the information being lost forever. More specifically, such a long 
delay before the escape of information provided a plausible resolution to some of the conceptual 
conundrums of quantum gravity, most notably the apparent inconsistency of information release with 
the quantum no-cloning principle [14]. 

More recent estimates using techniques from quantum information theory, on the other hand, 
suggest that information could be released from black holes much more quickly [15]. Those calcu- 
lations indicate that the relevant time scale is not the amount of time it takes for the black hole to 
evaporate but, instead, the amount of time the dynamics takes to "scramble" the black hole's micro- 
scopic degrees of freedom in such a way that initially localized perturbations become undetectable 
by observables that fail to probe a significant fraction of all the degrees of freedom. While a direct 
calculation of this scrambling time remains out of reach, the relaxation timescales associated with 
classical black holes are incredibly fast. So fast, in fact, that if they also govern the scrambling time, 
then the black hole complementarity principle, one of the guiding principles for many researchers in 
quantum gravity [14, 16, 17] is only just saved from inconsistency - faster scrambling would lead to 
a paradox. 

Motivated by these considerations, as well as the implications of the existence of fast scram- 
blers for the underlying structure of the degrees of freedom of quantum gravity, Sekino and Susskind 
elaborated on the speculations of [15] to formulate the following thiee-pait fast scrambling conjec- 
ture [18, 19]: 

1 . The most rapid scramblers take a time logarithmic in the number of degrees of freedom. 

2. Matrix quantum mechanics (systems whose degrees of freedom are n by n matrices) saturate 
the bound. 

3. Black holes are the fastest scramblers in nature. 

The purpose of this article is to explore the validity of the conjecture, focusing primarily on the first 
part. While the conjecture implicitly refers to the most rapid scramblers in nature, we allow ourselves 
the freedom to investigate the most rapid scramblers in quantum mechanics (and even slightly beyond) 
without worrying if our models are physically realizable. 

Thanks to earlier research in quantum computation by Dankert et al. , it is already known how 
to define a time-dependent Hamiltonian which will scramble in logarithmic time with high probabil- 
ity [20]. The scrambler, however, is a very carefully engineered quantum circuit, so that it is difficult 
to ascribe the fast scrambling specifically to interactions between the constituents as opposed to clever 
tuning of their external knobs. Ideally, therefore, we would like to exhibit a fast scrambler described 
by a simple time-independent Hamiltonian. To that end, we present two examples: 

• Brownian quantum circuits. The scrambler of [20] was a highly structured quantum circuit. 
Other work has studied circuits composed of random gates [21-25] but a rigorous proof that 
they scramble in logarithmic time remains to be found. Instead, we present a continuous-time 
analog of a quantum circuit in which the Hamiltonian is a stochastically varying two-body 
interaction, and prove that it scrambles in logarithmic time. 
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• Ising model. We consider scrambling by the antiferromagnetic Ising interaction on a general 
graph with an external field parallel to the spin quantization axis. Despite its triviality, this 
model nonetheless exhibits a form of weak scrambling in logarithmic time on some graphs. 

The careful reader will have observed that neither of these examples meets all of our criteria for a 
convincing scrambler: the Brownian quantum circuits are time-dependent, if not structured, and the 
Ising model fails to scramble fully. Nonetheless, we feel that, taken together, the examples provide 
substantial evidence that quantum systems with simple time-independent Hamiltonians can scramble 
in logarithmic time. 

The fast scrambling conjecture not only states that logarithmic-time scramblers exist, but also 
asserts that it is impossible to scramble faster. It might seem hopeless to address this question without 
invoking additional physical assumptions beyond just the validity of quantum mechanics. After all, 
scrambling is a form of information propagation, and limits on information propagation normally 
depend on locality. A Hamiltonian allowing all degrees of freedom to interact directly has no locality 
to speak of. Nonetheless, using bounds of Lieb-Robinson-type [26-28] to rigorously control a mean- 
field approximation, we are able to show the following: 

• Subject to some nontrivial norm assumptions on the terms in the Hamiltonian, no physical 
system described by a Hamiltonian with dense two-body interactions can scramble in time 
faster than 0(log n), where n is the number of degrees of freedom.' "Dense" here means that 
the number of interacting pairs of degrees of freedom scales like O(n^). 

• The bound extends to certain four-body Hamiltonians similar to the BFSS matrix model [8]. 

• With more sparsely interacting systems, there is a lower bound of 0{\/logn) on the scrambling 
time. 

While the norm assumptions are unfortunately too stringent to allow us to apply the results rigorously 
to the matrix model Hamiltonian and, thereby, to black hole physics, these results are strong evidence 
that scrambling in less than logarithmic time is impossible. (A related obstacle is our focus on distin- 
guishable degrees of freedom; bosonic degrees of freedom naturally lead to unbounded operators.) 

1.1 Related work 

Asplund, Berenstein and Trancanelli [29] have numerically investigated relaxation in matrix models. 
Their approach is to look at the classical dynamics of the system, with initial states selected stochas- 
tically in such a way as to enforce the uncertainty principle. They do indeed find what appears to be 
very rapid relaxation of the system to an attractor state, but their article only considers a fixed-sized 
and relatively small system, so it cannot directly address the scaling of relaxation time with system 
size. The relationship between this classical relaxation time and quantum mechanical scrambling is 
also an interesting and currently unexplored question. 

^Throughout the article, 0(/(n)) is used in the physicist's sense of "leading order". Readers familiar with asymptotic 
notation should for the most part reinterpret these expressions as 0(/(n)). 
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Barbon and Magan [30] have approached the conjecture from a different direction. They suggest 
that the logarithmic factor in the black hole scrambling time arises from the hyperbolic geometry 
of the so-called "optical metric" ds'^/goo associated to a simple coordinatization of Rindler space. 
Specifically, they argue that the Lyapunov time for a classical billiards game on such a geometry 
agrees with the scrambling time. 

More indirectly, while most work prior to [15] argued that black holes held information for an 
amount of time comparable to the black hole lifetime, if not forever, occasional hints were found that 
information might leak out faster [31]. Reversing the reasoning, one could interpret such arguments 
as evidence in favour of the fast scrambling conjecture. 

The seemingly paradoxical idea that a closed quantum system undergoing unitary dynamics can 
exhibit equilibration or thermalization is an old one dating back, at least, to von Neumann [32]; the 
apparent contradiction with the fact that the global state is pure and never equilibrates is resolved 
by noticing that any small subregion in an interacting closed quantum system generically becomes 
entangled with the rest and may appear, at least locally, thermal. For large systems the recurrence time 
is extremely long so, for all intents and purposes, it is meaningful to say that the system has become 
(locally) thermalized. There is now an enormous literature on this topic (see e.g., [33] for a textbook 
treatment). Recently these old questions have received new impetus from quantum chaos, quantum 
information theory, and many-body physics, all of which have brought new tools to bear [34-45] 
leading to an emerging understanding of the general conditions under which a closed quantum system 
will exhibit (local) thermalization. 

2 Scrambling: definition and properties 

Scrambling is nothing other than a strong form of thermalization applicable to closed system evo- 
lution. A closed system never forgets its initial state, but over time it might become impossible to 
distinguish different initial states without measuring a large fraction of all the system degrees of free- 
dom. The minimum time required for the information about the initial state to be lost is called the 
scrambling time. 

In general, the scrambling time depends on the nature of the set of initial states. For example, 
small perturbations of an equilibrium configuration will generally get scrambled more rapidly than 
will a pair of metastable configurations. Likewise, it could be easier to scramble a discrete set of states 
than all possible superpositions of those states. In this article, we will focus on product initial states, 
but a slightly different formulation will likely be necessary in order to study black hole physics. In 
particular, energy conservation will usually prohibit the strong form of scrambling we demand here.^ 

Suppose that we have a system with n distinguishable degrees of freedom and a Hamiltonian H = 
S(x y) ^{x,y) acting on a Hilbert space T-L = 'Hi(^ (8) • • • (8) T-Ln, where the sum ranges over pairs 
of degrees of freedom. An initial state |'I'(0)) evolves to a state = exp(— zf/'t)|^'(0)). 

For S C {1, 2, . . . , n} a subset of the degrees of freedom and S'^ the complement, let ^'^(t) = 
tr5c|*(i))(^'(i)|. 

general definition of scrambling appropriate to finite temperature will be included in an upcoming revision of this 

article. 
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Ideally, a scrambler will delocalize any information initially localized with respect to the factor- 
ization of T-L into subsystems. We therefore define the scrambling time to be smallest time t such 
that ^'5(t) ~ ^s{i) for all S such that \S\ < nn for some < k < 1/2, and for all initial states 
l^'(O)) and |<1>(0)) that factorize into the form jwi) (8) \t1J2) • • • For concreteness, we will fix 

K = 1/3, but its specific value will not affect our conclusions. 

The scrambling time obviously depends on the normalization of the Hamiltonian. In Sekino and 
Susskind's original formulation, the fast scrambling conjecture was that t^/ j3 > C(/3) log n, where /3 
is the inverse temperature and C is an unspecified function. In much of what follows, we will work 
either far from equilibrium, where /3 is not be well-defined, or near infinite temperature, where it 
doesn't accurately reflect the energy per degree of freedom (which stays finite as /3 — )• in the spin 
models we consider). This leaves a couple of alternatives for a dimensionless measure of scrambling 
time: 

• One can consider the ratio of the amount of time it takes to scramble systems of different 

(k) 

sizes, hopefully cancelling the temperature dependence. Let tl be the scrambling time for 
subsystems of size \S\ < k and set r* = ti"^^^ /ti^\ The revised conjecture is then that > 
0{logn). 

• The Hamiltonians we consider do not have their interactions arranged in a lattice structure. 
Instead, each subsystem S generally participates in a number of interactions growing with n. 
As a second option, one can require that the energy scales extensively with the system size 
n, thereby selecting a normalization for the Hamiltonian which, while coarse, is sufficient to 
determine the scaling of with n. 

The final step in formalizing the notion of scrambling time is to clarify the meaning of "^s{t) — 
^si't)- The trace distance provides a notion of statistical distinguishability that meshes well with 
the quantum information theoretic applications of scrambling. Specifically, one should demand that 
ll^sW - < e where \\X\\^. = tr VX^X. (See, e.g., [46] for a discussion of the statistical 

interpretation of the norm.) 

2.1 Scrambling as entanglement generation 

Scrambling information is by definition just storing that information in complicated correlations be- 
tween many subsystems, which means that scrambling is intimately related to the production of en- 
tanglement. In fact, the concepts are essentially one and the same. Intuitively, the reason is that if the 
restriction "^sii*) of ^ scrambled state is not highly mixed, then there won't be enough room in the 
Hilbert space T-L at time to accommodate all the scrambled states, which contain a basis for Ti. (The 
relationship is simplest when Ti is finite dimensional, which we will assume here but not elsewhere 
in the article.) 

Formalizing that intuition is a simple exercise in quantum information theory. Recall that the 
von Neumann entropy of a density operator p restricted to subsystem A is H{A)p = H{pa) = 
— tr Pa log Pa, and that the mutual information between subsystems A and B for p is defined as 

I{A : B)p = H{A)p + HiB)p - H{AB)p. 



-5- 



Fix an orthonormal product basis {|V'a:i)|V'x2) " " " iV'xn)} fov % = Hi ®'H2 ® ■ ■ ■ Hn- After 
time t^, all of these product states will be scrambled, so consider = exp{—iHt^)\ilJxi) ^ 

• • • "S* iV'xn)- It it convenient to introduce an auxiliary Hilbert space X and consider the following 
density operator on the combined XTi system: 



Pxn 



^ ^ 'xi,...,x„)(xi,...,x„|x®^'^'^^'-'''"^. (2.1) 



Xl,...,X 



The system X records in an orthonormal basis which state describes H, and the overall state is an 
equal mixture over choice of xi, . . . , 

Because subsystem S is scrambled, all of the states ij/^^i'-'^") = tr 5c ^jj^ essen- 

tially indistinguishable, so there can't be any significant correlations between X and S. A quantitative 
way of expressing that fact is that the mutual information I{X : S)p will be small, say less than 6. (A 
standard continuity result implies that 6 can be chosen to be 3e log dim Ti + f{e), where /(e) goes to 
zero with e and is independent of n [47].) 

On the other hand, the states - form an orthonormal basis for 7i, so their equal mixture 

is just the maximally mixed state on T-L. The state is by construction precisely that equal mixture. 
It follows that ps is also maximally mixed and, therefore, that H{S)p = log dim 'Hs'. 

Substituting into the inequality I{X : S) < 6 then gives 

log dim Us-S < H{XS)p - H{X)p. (2.2) 

The quantity on the righthand side, H{XS)p — H{X)p is known as the conditional entropy H{S\X)p 
of S given X. It can be interpreted as the uncertainty remaining in S once X is known and evaluates 
in this case to 

\ X~\ X ] \ X '{ X I 

the average entropy of the states ^ g ' . Inequality (2.2) thus ensures that the states ^ ^ ' "' " 
have high entropy, very close, in fact, to the maximum possible value of log dim ^5. (In the finite 
temperature setting, log dimT-Ls would be replaced by the entropy of the appropriate thermal state on 
S.) 

The entropy of a mixed state on 5 measures how much entanglement there is between S and 5^^ 
in the corresponding pure state. Good scrambling can therefore only be achieved by a time evolution 
that produces nearly maximal entanglement, and vice versa. 



3 Brownian quantum circuits 

A quantum circuit is an idealized model of the time evolution of a quantum computer, which is gener- 
ally assumed to consist of a number of qubits. At a given discrete time step, a collection of "gates" is 
applied to the state, where a gate is a unitary transformation involving one or two qubits. Each qubit 
participates in at most one gate per time step. 
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As mentioned earlier, Dankert et al. found a quantum circuit that scrambles n qubits after 
O(logn) time steps [20]. Their circuit, however, is quite an intricate construction that doesn't plau- 
sibly model any naturally occurring interactions. Other researchers have studied random quantum 
circuits, establishing that they are scramblers, but the question of whether they scramble in time 
O(logn) remains open [21-25]. 

In this section, we study a continuous-time analog of a random quantum circuit, which provably 
does scramble in time O(logn). Consider n qubits interacting according to a stochastically varying 
Hamiltonian. Time is subdivided into steps of length e = Ai and during a given time step, the 
interaction between each pair of qubits is given by a random Wigner matrix. More formally, the 
Hamiltonian from time = r At to t^+i = {r + 1) At is given by 

3 

^r- = ^ CT;^ ® AS,,,-fc,„„«,, (3.1) 

where the ABrj^k,aj,ak independent and identically chosen real Gaussians A''(0,e^) with zero 
mean and variance . The operator fi"^ represents the Pauli operator a"^ acting on qubit j, with 
the identity matrix. 

The time evolution from to to t^ is given by 

exp(-iFr-iAt) exp(-ii/,._2At) • • • exp{-iHoAt). (3.2) 

For this process to have a well-defined and nontrivial limit as At — 0, one must choose oc 
(At)^^ [48]. That is, the strength of the interactions must increase as the size of the time steps 
decreases. This requirement makes it problematic to interpret t^: in units of energy. Instead, we show 
that the ratio r* = tl'""^/tl^^ = O(logn) for constant < k < 1/2. More generally, the ratio of the 
time required to scramble systems of size k to the time required to scramble a single qubit scales like 
0(logA;). 

The limiting dynamics of the random Hamiltonian evolution is given by U{0) = I and U{t + 
dt) = ex.p{idG{t))U{t) for 



(3.3) 



where the dBj^k,aj,aj,{t) are independent Brownian motions with unit variance per unit time. Since 
we are only interested in r*, the normalization factor is of no real consequence; it is chosen such that 

\\dG{t)g = dt. 

Calculating using the Ito calculus (see [49] for an accessible introduction) leads to the following 
stochastic differential equation for U (t): 

n 3 -. 

dU{t) = -=^== y^ E ^j' ^<'^I\{mU(i)dB^.,-,it)-^Uit)dt. (3.4) 
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(In a slight abuse of notation, we henceforth write dBa^^akit) '■= dBj^k aj,aki't)- ^\{j,k} denotes the 
identity on all sites except for i and j.) Suppose we have some initial state |\I'(0)). Then the state 
qf(t) = U{t)\^{0)){^{0)\W{t) undergoes the dynamics 



3 

I 



j<k Q!j,afc=0 

^^4^)Y. E {^?^<'^h{m)^(i){^?^<'^h{m)di- (3-5) 

j<k aj,ak=0 

The time evolution will have scrambled subsystem 5 once ^'^(t) is independent of the initial 
state, as measured by the trace distance as discussed in Section 2. Equivalently, ^'^(t) should ap- 
proach a fixed state independent of ^(0). In the case of Brownian circuits, that fixed state is close 
to maximally mixed provided S is not too large. Rather than calculating ~ ^s/ dimT-LsWi 

directly, it is much easier to evaluate 

W^sit) - Is/dim-HsWl = tv^sitf - ^r^. (3.6) 

An application of Cauchy-Schwarz ensures that if tr < (1 + e^)/dimns, then - 

Is/ dim ^5 111 < e, as required for scrambling. 

We therefore introduce the purity of a subsystem S: 

hs{t) = tri^sitf)- (3.7) 

The equation of motion for the purity hs{t) is given by 

dhsit) = 2 tj:ips{t)dpsit)) + tvi{dpsit)f). (3.8) 

After some algebra, it is shown in Appendix A that (3.8) gives the following dynamics for the purity 
averaged over realizations of the Brownian motion, /15 = [hs] : 

n(n - 1)^^ = 2\S^\Y,hs\m{t) + 2 (|5^|(|S^| - 1) + |5|(|S| - 1) - n(n - l))hs{t)+ 



21-51 ^ hsu{k} - E ^s\{j}u{k}- (3-9) 



keS" jes 

kGS" 

Here 1^1 means log dim A. If the initial configuration ^'(0) consists of a pure product state, then hs 
depends only on l^l = k, so the system of ODE's collapses to a tridiagonal system and can be written 
in the form _ 

The rough features of the system (3.10) are sketched in Figure 1 and the system's behavior is studied 
in Appendix B, with the conclusion that the ratio scrambling time 

n=trAi~logn. (3.11) 
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Figure 1. Schematic plot of the decay of the average purity hic{t) of a subsystem S of size k. When the initial 
state is a pure product state all purities begin equal to one. The scrambling time for a system of size k is defined 
as the amount of time required before purity of subsystems of size k becomes less than (1 + 6)2^'^; a purity of 
exactly 2^*^ corresponds to the maximally mixed state. For subsystems of size smaller than n/2, the dynamics 
ensures that larger systems have smaller purities, a property not necessarily true of general entangled states. 



4 Ising interaction on random graphs 

There is an inherent difficulty in searching for fast scramblers: the intuition that a given system will 
rapidly scramble information is usually based on a sense that the dynamics is complicated, which is 
almost invariably an obstacle to studying the details of the system's time evolution. Complexity is not 
an absolute requirement, however. In this section, we will see that one of the simplest conceivable 
quantum mechanical systems has lessons to teach us about scrambling time. 

Let G = {V, E) be an undirected graph. Assign a spin-^ to each vertex v and allow spins 
adjacent with respect to the edge set E to interact via the antiferromagnetic Ising Hamiltonian 

^ = S E \{I-<)®{I-<) (4.1) 

{u,v}eE 

as illustrated in Figure 2. The normalization factor is chosen to ensure that the energy per 

spin scales extensively with the system size, n = \V\, as discussed in Section 2. Choosing lO^) and 
I Iz) to be the +1 and —1 eigenstates of a^, the Hamiltonian can be written more simply as 

^ = r^\ Yl ii'>(ii«»m(ii.- (4-2) 

The system obviously can't scramble because any product state of the form |if)|i|) • • • is an 
eigenstate of H. Local information encoded in that basis remains locally accessible for all times. On 
the other hand, information in the conjugate basis of cr^ eigenstates, |0^) and potentially has 
more interesting behavior. Suppose then that the initial state is |^'(0)) = 1^1)^2) " " " Kn)- 

The system is periodic with period 27r|i?|/|F| and the state at time t is most entangled at 

time tent = 7r|-E'|/|^|- The state |^'(tcnt)) is known as a. graph state in quantum computation, where it 
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V 



Figure 2. Antiferromagnetic Ising interaction on an undirected graph G — (V, E). There is term Hi^^ ^^ in the 
Hamiltonian for each edge (u, v) E E of the graph. Generic sparse graphs with average vertex degree roughly 
log \V\ will quickly scramble information stored in the simultaneous {crj : v e V} eigenbasis. 

plays a central role in the measurement-based quantum computing architecture [50, 51]. For a subset 
S <^V of spins, the entanglement entropy of the density operator ^'^(tent) = tr^c |^'(tent))(^I'(ient)| 
has a simple formula in terms of the submatrix Adj^ of the adjacency matrix of G that selects the 
rows of S and the columns of S'^ [52] : 



where the entropy is measured in bits. It follows that if Adj5 has full rank as a matrix over Z2, then 
the entanglement is |5| bits. The only density operator with IS*! bits of entropy on |5| qubits, however, 
is the maximally mixed density operator. Therefore, if Adj^ has rank the final density operator 
on S will be independent of the choice of initial state |^(0)) = |if ) • • • That is, the system will 
have scrambled the eigenstates. 

Each edge from S to S'^ contributes a nonzero entry to Adj^, but formula (4.3) implies that too 
many connections can reduce entanglement. For example, for the fully connected graph, every row 
of Adjg is just a sequence of ones, so there is never more than one bit of entanglement entropy. To 
maximize the entanglement between S and S"^, one needs the matrix Adj^ to have full rank for all 
\S\ < n/2. This is generically the case for appropriate random graphs in which edges are included 
randomly and independently in G according to the rule Pr[(u, v) € E] = p. 

Since ient = vr|£'|/n, minimizing tent requires minimizing the expected number of edges in the 
graph, which is {^)p, subject to the constraint that the rank of Adj^ be maximal for all \S\ < n/2. As 
n goes to infinity and |^ goes to any constant a, the rank defect of the matrix is Poisson distributed 
with parameter ae"'*' provided (logn + 7)/n <p< 1 — (logn + 7)/n [53]. Therefore, Adj^willbe 
full rank with probability at least 1 — e~'^. Thus, the minimal value of tent is equal to 7r(logn + 7), 
where 7 can be regarded as a constant. 

Even though the system doesn't scramble fully in the sense of making all local information locally 
inaccessible, it does scramble the basis of cr^ eigenstates and does so in time logarithmic in n, as 
required of a fast scrambler. 



Si'^sitcnt)) = rankza Adj^ 



(4.3) 
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For the sake of comparison with the Brownian circuit model, it is also instructive to consider 
the analog of , the ratio of the amount of time to scramble systems of size ku to the time required 
to scramble a single qubit. Since the system is exactly solvable, it is straightforward to establish by 
direct calculation that for S = {j} a singleton, the Hamiltonian (4.2) and initial state |^'(0)) imply 



where dj is the number of graph neighbors of site j. The expected number of neighbors per site is 
p{n — 1). Requiring that (4.4) be close to minimal, i.e. |, gives the 1-scrambling time as 0{y/pn) = 
0{y/\ogn). The ratio of the times required for scrambling eigenstates therefore scales like 0{^J\ogn) = 
0(log n/ yjlogn). This hints at the possibility that for systems that do scramble all product states, 
unlike this Ising model, n might also fail obey an 17 (log n) lower bound as required by the fast 
scrambling conjecture. 

Regardless, the Ising model provides an example of a system capable of producing large scale 
multipartite entanglement sufficient to scramble all information stored locally in a fixed basis on a 
time scale no more than logarithmic with the number of degrees of freedom. 

5 Lower bounds on the scrambling time 

One way to prove lower bounds on the scrambling time is to exploit the connection between scram- 
bling and signalling. In particular, scrambling a subsystem S implies the ability to signal to the 
complementary subsystem S^. The main task of this section is therefore to prove signalling bounds, 
but we must do so without relying on relativity or, more generally, any underlying geometry in the 
organization of the degrees of freedom. Our technique goes back to Lieb and Robinson [26], who 
proved bounds on commutators [OA{t), Ob] for observables Oa and Ob localized on subsystems A 
and B of lattice spin systems. To signal reliably from B to A, there must be normalized observ- 
ables for which the norm of the commutator is 0(1). Hastings improved the original Lieb-Robinson 
technique so as to produce dimension-independent bounds [54] and Nachtergaele-Sims showed how 
to adapt it to general graphs [27]. The version we start from combines both features and is due to 
Hastings and Koma [28]. 

As we will see, the Lieb-Robinson technique gives lower bounds on the time required to signal 
from B to A provided A and B are both constant-sized subsystems. The definition of scrambling 
used in this paper, however, only imphes signalling from a constant- sized B to the complementary 
subsystem S'^, and S'^ will generally involve at least half the degrees of freedom in the whole system. 
To deal with this large S^, we use the Lieb-Robinson bound to show that a mean-field approximation 
to the time evolution remains reasonably good for sufficiently short times, provided the initial state 
has product form. For as long as the mean-field approximation holds, the dynamics cannot generate 
any significant entanglement, which prohibits signalling to S'^ and, of course, scrambling. 

5.1 Scrambling implies signalling 

Any information initially stored as a state onl-Li will have become inaccessible to measurements on 
S alone once scrambling has occurred. One way of phrasing this mathematically is by introducing 




(4.4) 
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a "reference" system N that does not participate in the interaction and will initially be entangled 
with system T-Li. The scrambling condition ensures that if the initial state has the form |^'(0)) = 
iV'i) NHi ^ |V'2)'H2 "SD ■ ■ ■ then the time evolution destroys any entanglement between N and 

^1 in the sense that 

W^NsiU) - *7v(0) » ^s{t*)\\i < erank^-ivW. (5.1) 

(See, e.g., Lemma 19 of [55].) To study signalling of a single bit's worth of information, it suffices to 
let \'4^i)Nni = ~^{\^)N\0)ni + |l)Af|l)wi) for a some orthonormal states |0) and |1). 

As discussed in [15, 56], inequality (5.1) implies that the entanglement with N can be recovered 
without use of the degrees of freedom of S. That means there is a unitary transformation V on 5^ and 
a qubit subsystem M of S'^ such that 



|^)7VM>l-2e (5.2) 



for the maximally entangled state |^) = :^(|00) + The ability to send entanglement to S"^ in 

this way is at least as strong as mere signalling, however. Working from (5.2), standard manipulations 
imply that if "Hi were prepared in one of two orthogonal initial states lO)-^^ and \ l)ni then there are 
orthogonal projectors Ho and IIi on S'^ such that 



^ trno^'l°i(t,) + ^ trHi^-l^iCt,) > 1 - 4e, (5.3) 



where |^'(-'^(0)) = \j)y,i '^^ |V'2)'H2 ® " " " ® |V'n)'H„- That is, the signal has been transmitted from 
T-Li to S'^ with an average probability of error in the decoding of at most 4e. These conclusions are 
illustrated in Figure 3. 

5.2 Lieb-Robinson bounds for nonlocal interactions 

As has been the case throughout the paper, the state space will have the form T-L = Hi ^ ■ ■ ■ %n- 
Suppose that the Hamiltonian has the two-body form H = ^{x,y)^ where the sum is over un- 

ordered pairs of sites {x, y) and each of x and y range from 1 to n. Each term H(^^^y^ acts only on 
fix ® T-Ly. We can associate to such a Hamiltonian an interaction graph G = {V,E) with n vertices 
representing Hilbert spaces Tii, . . . and edges connecting vertices x and y if the 2-body inter- 
action term H^^^^y^ is nonzero. The antiferromagnetic Ising interactions discussed in Section 4 are a 
special case, and the graph of Figure 2 is, of course, the interaction graph. Denote by D the maximum 
degree of any vertex in the interaction graph. Let us further require the constraint \\H(^rc,y) II < c/L* on 
the strength of pairwise interactions for some constant c. Physically, this constraint ensures that the 
energy per degree of freedom will remain finite for all states even in the limit n — )• oo. 

For X C {l,2,...,n}, denote by Ax the algebra of bounded norm operators acting on fix- We 
start by discretizing time into steps of size e = t/N for some large integer and let tj = je. Then, 
for observables Oa G Aa and Ob ^ Ab, 

N-l 

[OA{t),OB] = [Oa, Ob] + {[OA{tj+i), Ob] - [0^(t,), Ob]) ■ (5.4) 
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Figure 3. Scrambling implies signalling. Site 1 is prepared in one of two orthogonal states \ j) for j either or 
1. All other sites are prepared in states that are independent of j. After the scrambling time any subsystem S 
of size at most nn will be essentially independent of j, but the reduced states (t*) on the complementary 
subsystem 5^ will be nearly orthogonal. Scrambhng therefore implies signalling from the first site to the 
complementary system S*^. 



The observable Oa evolves after time e to Oyi(e) = e*'*^ Oa e ^'"^ + O(e^), with 

The norm of each term of the sum in (5.4) can be bounded from above using 

\\[OA{t,+i),OB]\\ = \\[e"^^OAe-''^OB{-tM+0{e') 

< \\[0A,0B{-tM + 4[0A,[h,0B{-tM + 0{e''). 

Hence, we have 

N-l 

\\[OA{t),OB]\\ < ||[OA,OB]||+2e||OA|| J2 \\[h,OB{-tM+0{e). (5.5) 

j=0 

In the limit e — )• 0, the above expression becomes the inequality 

\\[OA{t),OB]\\ < \\[Oa,Ob]\\+2\\Oa\\Y,Y1 f ds\\[H^,,,){s),OB]\\. (5.6) 

We now specialize to the case where B is the singleton set {y}. Fixing attention on a particular 
Ob G Ay, define 

[Ox{t),OB]\\ 



CB{X,t)= sup 

Ox&Ax 



\0x\ 



(5.7) 
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Figure 4. Proving the Lieb-Robinson bound on a graph involves a sum over pairs of vertices that contain 
paths between x and y. Starting with a set of edges, paths can be visuaHzed for the purpose of counting as 
different ways of identifying vertices in successive edges of a sequence. For example, in the figure, each bubble 
represents an edge and the blue lines indicate the identified vertices: 3 and = y. There 

is therefore apath with the following edges: {x, z), {z, z[), {z, Z2), (22, y) and (y, Z4). 

If the subsystem X in the inequality (5.6) is A, we have 

CB(At) <Cb(A0) + 2 V V f dsCB{{x,z),s)\\H^,^,)\\, (5.8) 

xeA z -Jo 

whereas for X = {x,z) v/e. obtain 



CB{{x,z),t)<CB{{x,z),0) + 2 V f dsCB{{zi,z[),s)\\H(^,^,,,J. (5. 

, Jo 



9) 



(x,z)n(zi,z[)^fll 



By using the above bound iteratively in (5.8), we find 

CB{A,t)<CB{A,0) + 2Y,Yl f dsCB{{x,z)M\H^x,z)\\ 

+ 4V V f\s f d~sCB{{zi,z[)M\H^z,,z[)\\\\Hi^x,z)\\ + --- i^-m 



x^A z,zi,z' 



1- 

{x,z)n{zi,z[)y^(D 



By definition, at time t = 0, the function Cb{{z, z'), 0) is zero unless z = y ox z' = y. Moreover, 
from the definition in (5.7), it is clear that Cb((-2;, y), 0) < 2||Ob||. Thus, 

Cb{A, t) < {2t) 2\A\ \\Ob\\ ||^(.,.) II + ^ 4|^| IIObII Y1 W^i-^^v) H H 



z,zr. 
(a:,2>n(2i,s/>^0 



+ ^4|^|||Ob|| ||%,,,>||||%„4>||||i^^(x,.>ll + - (5.11) 

z,zi,z'^,Z2: 
(a;,2>n(2i,4>^0 
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On a graph of maximum vertex degree D, the i sum in the right hand side of (5. 1 1) has at most 
4(4£))*"^ terms, which can be seen by a simple combinatorial argument. The sums that appear have 
the form 

E 

z,zi,z[,...,Zi^i: 
(a;,2;>n(2i,4>7^0 

One can think of terms in the above sum as paths made from edges that connect y and a; G ^, as 
illustrated in Figure 4. A path is made by identifying a vertex in each pair {zj, z'j) with a vertex in 
{zj+i, z'-^^) . Once a vertex Zj is identified with some -Zj+i , there is a maximum of D different choices 
for z'-^-^^ because the interaction graph has maximum degree D. The path starts either at x or z and 
ends either at or y. For each of these cases, it is not hard to see that the number of paths of length 
i is less than (4D)*^^. Therefore, the overall number of terms in the sum (5.12) is always less than 

Moreover, from the constraint || < c/D on the strength of two-body interactions, it 

follows that each term is bounded above by {c/Dy. Therefore, 

Cs{At)<4\A\\\OM\f:^^{^yHm'~' < MMeS- (5.13) 

i=l 

Finally, note that the arguments of this section are not restricted to the two-body case. Appendix C 
shows, for example, that a very similar bound holds for Hamiltonians structured like that of the BFSS 
matrix model. 

If it were possible to signal from i? to yl in time tsignah then there would exist unit norm operators 
Oa e Aa and Ob G Ab such that (^(0)|[OA(i), Oi?]|^(0)) > 6 for some 5 = 0(1). A direct 
application of (5.13) then implies that 

t..,naZ>^log(|^). (5.14) 

In the case of a fully connected graph, D = n — 1, which would seem to force logarithmic scaling 
of the signalling and, therefore, of the scrambling time. Unfortunately, as discussed in Section 5.1, 
scrambling only implies signalling to S'^ so we must take A = S"^, and systems of size larger than 
n/2 don't scramble, so \S^\ > n/2. Naive substitution into (5.14) then yields no bound at all on the 
scrambling time so further analysis will be necessary. 

5.3 Scrambling highly mixed initial states 

It's interesting to note that (5.14) does yield a logarithmic lower bound for the type of scrambling 
relevant to information retrieval from highly entangled black holes. This paper has thus far focused 
exclusively on pure initial states for Ti. Replacing |^'(0)) with a state pure on Hi and maximally 
mixed on ^2 through Tin corresponds to a different communication scenario. The retrieval of the 
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Figure 5. Scrambling implies signalling for mixed initial states. Site 1 is prepared in one of two orthogonal 
states |j) for j either or 1, and all other states are prepared in states that are independent of j and highly 
mixed. These mixed states can be viewed as parts of pure states that are entangled with environmental degrees 
of freedom E2 through £'„. When the initial states are maximally mixed, it is possible to scramble subsystems 
S of size n — 0(1). This leads to signalling to the complementary degrees of freedom 5^, adjoined with the 
environmental degrees of freedom E — E2 ■ ■ ■ En- That is, the states 4*50^ (i*) are nearly orthogonal to each 
other. Because S"^ can be taken to be constant-sized, the Lieb-Robinson bound provides nontrivial lower bounds 
on the signalling, and hence scrambling, time in this setting without the need for additional argument. 



information stored in T-Li would need to make use of some degrees of freedom C {1, 2, ... n} 
supplemented by the environmental degrees of freedom required to "purify" the initial state. When 
the initial state is so highly mixed, however, it is possible to scramble many more degrees of freedom 
than when the initial state is pure, leading to a much smaller S'^. The resulting signalling scenario 
is illustrated in Figure 5. Brownian quantum circuits, for example, will scramble subsystems S of 
size n — 0(1), leaving a constant-sized complementary system S"^ with \S'^\ = 0(1). Because the 
environmental degrees of freedom don't participate in the interaction, one can take \A\ = = 
0(1) and recover the logarithmic lower bound on scrambling from (5.14). Moreover, it is necessary 
to consider these larger systems: numerical investigations show that it is possible to scramble any 
constant fraction of the degrees of freedom in constant time if the initial state is highly mixed. 

5.4 Controlled mean-field approximation via Lieb-Robinson 

Having proven the Lieb-Robinson bound, we now prove that up to times of order log(D), the reduced 
density matrix on each site x is close to a pure state. Since scrambling requires entanglement, this will 
provide the desired lower bound on the scrambling time. Since D is the maximum vertex degree, this 
evaluates to an order log(n) lower bound for Hamiltonians in which every degree of freedom interacts 
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with a constant fraction of all the others. 

A slightly subtle point is that all of a system's single site density operators can in principle be 
close to pure even if the wavefunction of the whole system is not. The issue is that the number of sites, 
n, is large, and the overlap of the true wavefunction with the mean-field pure product state can easily 
be a factor exponentially smaller in n than the corresponding single-site overlap. The analysis of this 
section will therefore not imply that the wavefunction of the whole system is product up to times of 
order \og{D). 

We begin by defining a time-dependent "mean-field" Hamiltonian 

H^^{t) = Y,Hnt), (5.15) 

X 

where each operator is supported on site x. We define the operators i^^^^ self-consistently 

as follows. Let ^'^^^(t) be the reduced density matrix on site x at time t assuming that the state 
is initialized to a product state |^'(0)) = \ipi)ni ' " \'^n)Hn at time t = and evolves under 
Hamiltonian H^^. Then 

dt^^^it) = ^(t), *f ^(i)]. (5.16) 

We then define 

Hnt) = ^tr,(if^,,,)<^(i)). (5.17) 

y 

This provides the self-consistent definition of H^^. We also define 

<S=tr,(i?(.,,)<^(i)), (5.18) 

sothati7f^ = E,<J. 

Define ^^(0 to reduced density matrix of the state evolving under Hamiltonian H = 

^(x y) ^{x,y) again assuming that the state is initialized to the product state |*(0)) at time t = 0. We 
now prove that, for t small compared to log(7T,), ^x{t) is close (in trace norm distance) to *^^(f). 

For notational convenience, we will write (O) to indicate (^'(0)|O|*(0)). Further, we define a 
unitary U^^(t) to define the mean-field evolution on site x by 



[/^^^^(0) = /, (5.19) 

and 

dtU^^^it) = -iH^^{t)U^^{t). (5.20) 

Also, define 

U^^{t,s) = U^^{t)U^^{s)^. (5.21) 

That is, U,^^{t, s) describes mean-field evolution from times s to time t. Since the mean-field time 
evolution on all of T-L for time t has the form it can never generate any entanglement 

between different sites. For as long as it remains a decent approximation to the true time evolution, 
scrambUng will be impossible. 
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Similarly, we define U{t) to be the unitary describing evolution under H, with 

U{0) = I, 



(5.22) 



and 



dtU{t) = -iHU{t). (5.23) 

Define C/(s, i) = U{s)U{t)^. 

In proving the Lieb-Robinson bound above in Section 5.2, we used the Heisenberg notation 
for operator evolution: 0{t) denoted U{t)^OU{t). In this section, we will not use this Heisenberg 
notation, and we will instead explicitly write out U{t) or f/(t)^ to describe evolution of operators or 
states. The reason for this is that we are going to evaluate the expectation values of operators whose 
time-dependence is not necessarily given by conjugation by U{t), so that the parenthetical (t) could 
be ambiguous if we were to use it to denote Heisenberg evolution. 

Consider any operator Ox supported on site x. For any two times, ti and tf,we have 



+ i 



I ds U{s, i,)t [(^H - H^'^is)) , t/f ^(t/, s)^OxU^'^itf, s)] U{s, U). (5.24) 

This equation can be proven by differentiating the right-hand side with respect to t,; and verifying that 
the result is equal to the right-hand side multiplied by i and commuted with H. Call the first and 
second terms on the right-hand side Ti and T2 respectively. When Ti is differentiated with respect to 
ti the result is i[H^^^{ti),Ti], while differentiating T2 with respect to ti gives two terms, one from 
the change in the limit of the integral (this term is equal to i[H — H^^^{ti), Ti] and adding this to 
the derivative of Ti respect to ti gives i[Ti, H]) and one term from the change in U{s,ti) which gives 
i[T2, H]. Specializing to ti = 0, we have 



H - H^'^'is)) , U^'^it, s)^OxU^'^{t, s)] U{s). (5.25) 



U{t)^OxU{t) = U.^'^^{tyOxU^'^{t) 

ct 

+ i I dsU{s)^ 



We will apply this equation to the specific case of the time-dependent operator Ox = ^ 
using it to compute the expectation value 



(^(t)t(i - ^^^^{t))u{t)) = 1 - {^r{t)\^x{mr{t)) 



and show that ^x{t) is close to 

Note that for any operator O supported on x, we have 



h-h!^^^{s),o 



y 



ttMF ri 



(5.26) 



(5.27) 



This holds in particular in the case that O = Ux^^{t, s)^OxU^^{t, s) as in Eq. (5.25). For any given 
y, the trace tvy{'^^^ {s){H(^x,y) - ^{xy))) = 0- ^^^^ ^^^^ that ^'^'^(s) is a projector for all y and s. 
Write 

L{x,y){s) + R{x,y){s), (5.28) 



{x,y) 
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where 

L^^^y^is) = (1 - - (5.29) 

and i?(^^j^)(s) is defined by Eq. (5.28). Then, ^'^^^(s)L(^. (s) = and similarly (5)^*^-^(5) = 
0. 

Taking into account Eq. (5.27) as well as the definitions of L and R, we can replace 
in Eq. (5.25) with a sum over y of L(^rc^y){s) + R(^x,y){s). This gives a sum over y of a sum of two 
terms (the L and R terms). Consider an L term for given y, s. This is 

U{sy[L^,^y) (s), U^'^it, s^O^U^'^it, s)]U{s) (5.30) 

= (c/(.)t(l-M/f^(.))f/(.)) 

X (c/(.)t[L<,,,)(.), C/f ^(t, s)tO,.t/f ^(t, s)]C/(s)) . 

Similarly, for an R term, we write 

U{s)^[R(^.,,y){s),U.f''{t,s)^O^U^'''{t,s)]U{s) (5.31) 
= C/(s)t[L<,,^^(s),;7f^(t,s)t0.t/f^(t,.)](l-^.f^(s))C/(s) 

= (u{s)nL^,^y){s),U^^{t,s)^O.Unt,s)]U{s)) 

For an L term, we apply Eq. (5.25) to the first term (jj (s)^^(l — ^'^^^(s))C/ (s)^ on the last line of 

Eq. (5.30), while for an R term, we apply Eq. (5.25) to the last term (u{s)'^{l - ^'^^^(s))C/(s)^ on 
the last line of Eq. (5.31). 

We proceed iteratively in this fashion, getting an infinite series of terms. Each term in the series 
at a given order, say the A;-th order, involves a fc-fold integral over si, S2, • • • , Sfc, with < si < • • • < 
Sk < t. Further, each term in the series has a sum over k different sites yi,y2, ■ ■ ■ ,yk and finally each 
term has a sum over k different choices of L or terms. Our goal is to bound the expectation of the 
sum of terms at fc-th order. Each such term will have one operator 1 — "^y^{sk) in it. This operator 
may be in the middle of a sequence of terms. Suppose the last term was an L term. Then we have 
some expectation value 

(p(c/^^^(.)t(l - ^'yi^{sW^{s))Q) (5.32) 

for some operators P,Q. We commute (u^^^{s)'^{l - ^'^^^(s))C/*^^(s)^ through P using the 
Lieb-Robinson bounds above. Note that the reason that we choose to commute through P rather than 
through Q is that whenever the last term is an L term, one of the operators in Q is L(^y^__^^y^y We 
would not be able to bound the associated commutator since Q has support on y^.. Conversely, if the 
last term was an R term, we commute to the right through Q instead. Note that 

{{u^'^is)Hl - ^^i^{sW^{s))s) = (s([/^'^^(.)t(l - ^^'JisW^is))) = (5.33) 
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for any operator S. Therefore, the expectation value Eq. (5.32) is bounded by the commutator 
(u'^^^{s)^l - ^yl^is))U^^{sfj]\\ in the case that the last term was an L term. (Similarly, it 
is bounded by a commutator with Q in the case of an R term.) 

To bound this commutator, we consider two different cases. First, there is the case that yk yi for 
1 < i < A:. In this case, we can bound the commutator by (const. /Z))^ x (k/D) x exp(const. x t) 
using the Lieb-Robinson bound from Section 5.2, which contributes a factor of const, x {l/D) x 
exp(const. x t). The factor of k appears because P is a product of up to k different operators while 
the final factor of (const. /D)'^ comes from the fact the norms of all of the operators -Z>(x,3/) and R{x,y) 
are bounded above by const / D. The case when yj^ = yi for some 1 < i < k might seem to be more 
problematic because the Lieb-Robinson bound doesn't apply, but we will see below that this bad case 
happens infrequently enough to not affect the final conclusion. 

To bound the sum over terms in the series at given order, we note that the sum over choices of 
yi,. . . ,yk decomposes into these same two cases. The sum in the first case, when y^ / y-i for all 
1 < i < k, is bounded by 

(A^ \ ( const X 
— I — exp(const. x t), (5.34) 
D J k\ 

where the factor of (k/D) exp(const. x t) is due to the commutator bound, with the factor of l/D'^ 
that was present there cancelled by an in the numerator arising from the sum over yi , . . . , t/^. The 
factor of t''/k\ in Eq. (5.34) arises from integrating over the k different times < si < • • • < < t. 
Summing over the different choices of L or i? contributes an extra factor of 2^ which can be absorbed 
into the constant raised to the power k. In the second case, when y^ = yi for at least one 1 < i < k, 
the sum over yi is bounded by const, x (k/D) x ('^"'^'"t.xf) ^j^^j.^ ^j^g factor of k/D arises because 
any of the k — 1 different ?/j for 1 < i < /c may be equal to yk- (By constraining the choice of yi we 
reduce the number of different choices for yi in the sum.) 
So, the sum over all orders k is bounded by 



k \ (const. X t) 



k 



oo 

const. X [j)]^ ~ ~ ^ 6xp (const. 

k=l ^ ^ 

< const. X (l/D) X exp(const. x t). (5.35) 

Recall that this is an upper bound on the quantity 1 - (^'^^(t)|^'^(f)|*^^(t)), the deviation 
of {t) from being a pure state. If the deviation is small at time t, the continuity of the von Neu- 
mann entropy implies that H{"i^x{t)) < 5 log dim "Hj^ for some universal 6 going to zero with the 
deviation [47]. The subadditivity property of H then implies that 

Hi^sit)) < Y,H{^x{t)) < <51ogdim-?^5. (5.36) 

As discussed in Section 2.1, scrambling requires that H{^s{t)) be close to its maximal value of 
log dim ^5, which can only occur if the deviation of each ^'^ (t) is significant. For this to happen, 
(5.35) requires that t be order log(L'), which is the desired lower bound on the scrambling time 
provided D ^ n. (Note that it is equally possible, if slightly more technical, to supply a dimension- 
independent argument.) 
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5.5 Sparse graphs 

If the degree D is constant or even scaling sublinearly with n, then (5.35) might not be a useful 
bound. For sufficiently slowly growing D, however, it is possible to substitute the more traditional 
Lieb-Robinson bound for the version proved in Section 5.2. Specifically, the version of the bound 
proved in [28] ensures that 



for some positive constants v and ^. The function d{A, B) measures the distance from A to i? in 
the interaction graph so the interpretation of (5.37) is that there is a maximum effective velocity v of 
information propagation between degrees of freedom. For complete graphs, the bound is trivial, but 
not for graphs of lower connectivity. 

In particular, there can be at most vertices at distance exactly I from any fixed vertex. It 
follows that at most a fraction a of all pairs of vertices x and y can satisfy d{x, y) < log(an) / log D. 
Therefore, most x and y satisfy d{x,y) > 0{logn/ logD). Substituting into (5.37) and comparing 
with (5.13) implies that the signalling time between x and y must satisfy 



For regular graphs, in which every vertex has degree D, this reasoning can even be extended to 
the scrambling time t*. From the mean-field argument, we already know that > 0(log D). A direct 
application of Lieb-Robinson, however, requires that > 0(log n/ log D). To see this, fix x and let 

5 be the set of all sites y such that d{x,y) < logn/log(Z) — 1) + const. This will be a constant 
fraction of all the sites. Different initial states at site x are eigenstates of rank one projectors acting 
on that site. By a standard argument [54], (5.37) ensures that for times t < d{x, S^)/v — const., the 
time-evolved projectors will be well-approximated by operators acting only on S, in which case the 
different initial states can be distinguished by measurements on S alone, which is inconsistent with 
scrambling. Optimizing over D as in (5.38) yields t^: > 0{\/log n). 

6 Conclusions 

We have explored two aspects of the fast scrambling conjecture, both of which are implicit in the 
statement that the most rapid scramblers take a time logarithmic in the number of degrees of freedom. 
For the statement to be true, there must exist systems scrambling quickly enough to saturate the bound. 
Conversely, no system should be capable of scrambling in time faster than logarithmic. 

We demonstrated that Brownian quantum circuits and the Ising model on sparse random graphs 
both scramble information in logarithmic time. Each example, however, has its own deficiencies, not 
quite meeting the objective of finding a time-independent Hamiltonian that scrambles all locally avail- 
able information in logarithmic time. Namely, Brownian quantum circuits are not actually described 
by a time-independent Hamiltonian, while the Ising model only scrambles information in one ba- 
sis, leaving the conjugate basis invariant. Nonetheless, the examples illustrate that the entanglement 



[OaW, Ob] < const, x exp [{vt - d{A, B)) /^]\\Oa\\\\Ob 



(5.37) 




(5.38) 
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creation required for scrambling can indeed be accomplished in logarithmic time without the need 
for an intricately structured Hamiltonian. Finding a completely fast scrambling time-independent 
Hamiltonian remains an open problem. While it's simple enough to write down plausible candidates, 
analyzing them is a challenge. 

To find limits on scrambling, we used Lieb-Robinson techniques to prove a general lower bound 
on the scrambling time of arbitrary quantum systems with two-body interactions. The strategy was 
to estimate the amount of time required to signal in such systems, which in turn bounds the amount 
of time required to scramble. Mathematically, we used a modified Lieb-Robinson bound to argue 
that for sufficiently small times, a mean-field approximation to the single-site evolution is a good 
approximation. If most pairs of systems interact with terms of comparable norm in the Hamiltonian, 
the result is a logarithmic lower bound on the scrambling time. The same bound applies to four- 
body Hamiltonians with structure similar to the BFSS matrix model. However, our argument does 
contain a loophole: in the general case of graphs with lower coimectivity, we could only prove a 
requirement that the scrambUng time be at least 0{^/\ogn), although we strongly suspect that this is 
only a reflection of the limitations of our technique. 

One of the lessons of this investigation is that some plausible mathematical formulations of the 
conjecture are false. In the case of the Ising model, for example, the scrambling time ratio = 
t^/t^\ which a priori one might have thought should also grow at least logarithmically with the 
number of degrees of freedom, is parametrically smaller. More subtly, the fast scrambling conjecture 
is formulated in terms of pure initial states and scrambling sets S of size l^l = Kn for constant k. 
The argument for rapid release of information from highly entangled black holes, however, requires 
starting from a mixed initial state and studying larger scrambling sets S of size n — 0(1) instead of 
just Kn. We have found logarithmic lower bounds on the scrambling time in both cases but not using 
identical reasoning. The pure state scenario, perhaps surprisingly, was more difficult to analyze. 

The understanding gained here should ultimately be helpful in properly formulating and evaluat- 
ing the scrambUng time of matrix quantum mechanics or other models of black holes. The correct ana- 
log of the simple decomposition into subsystems used here already poses a bit of a puzzle. Likewise, 
since some initial configurations are known not to scramble quickly, care is required in identifying the 
set of states that are rendered locally indistinguishable by the dynamics. The correct analog of "local 
information" should be physically well-motivated and basis-independent. The reward for resolving 
these issues will be great: a microscopic description of information leakage from black holes and, 
more generally, a deeper understanding of how nonlocal degrees of freedom in quantum gravity can 
be reconciled with the causal nature of semiclassical physics. 
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A Equations of motion for Brownian quantum circuits 

In this appendix we describe in detail the dynamics of the purity of the subsystem S as it evolves 
according to a Brownian quantum circuit. Our starting point is the equation of motion for ^'^(i). This 
can be found by tracing out the degrees of freedom in S*^ in (3.5): 



, ^ = "S^ tr5c 
v/8n(n - 1) ^ „ 



a 



+ 8n(n - 1) ^ i ^^S^{{'^?^'^k'^h{m)^(t){''?^'^k'®h{m))dt- (A.l) 



8n(.. 

j<k Oj ,Ofe=0 

The right hand side of this equation of motion consists of a noisy part 

3 



and a noiseless part 

1 ^ 

(tt) = -^gW^^+ 8^(n-i) E E ® C ® Ab>}) » C ® him)) dt 

(A.3) 



We'll deal with both of these terms in turn. First, the noisy part (f) can be reduced to 

3 



(t) 



V^nin - 1) . 



<fce5 a, ,Qfe=0 



+ y=riy E E_j-?'^5^(*)]^^".-(*)' (A- 



4) 



fees': 



where 



(A.5) 



and we have omitted tensor products with the identity to make the expressions more compact. The 
noiseless part (ff) can be rewritten as 

(tt) = -^sm + Yl ^'S^ (4 ^ dt, (A.6) 



j<k 
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which expands to a form that distinguishes different contributions: 

^ ' j<k£S 

+ E ® dt + l^'l(l-^'|-^) vi/g(t) dt. (A.7) 

nin — I) ^-^ ' n n — 1 

Reassembling the pieces yields the final equation of motion for ^s{t)- 

3 



j<keS oij,ak=0 
3 



^ ' j<kes ^ ' jes 

s^iiis^l - 1) 



, , ^s{t)dt. (A.8) 
n{n — 1) 



By another application of Ito's rule, the equation of motion for the purity hs{t) can be derived from 
the relation 

dhsit) = 2 tvi-if s{t)d^s{t)) + tri{d^s{t)f). (A.9) 

Because of the number of terms, it will be necessary to work with the equation of motion in pieces, 
as we did for "i>s{t): 

dhsit) = {*) + {**) + {***), (A.IO) 

where (*) and (**) are, respectively, the noisy and noiseless parts coming from the first term in (A.9), 
and (* * *) is the contribution of the second term. Firstly, (*) is given by 

3 

3 

' E E t<^s{t)[a;^ ^Is\{,},^fmdB^^,^,{t). (A.11) 

keS" 

There is no need to simplify this term any further because it will average to zero when we consider 
hs- The second term is more important for what follows: 

1 2|>S''^| 



^f^m(}S±ll_2]hs{t)dt. (A.12) 
' n(n — I) ' 
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Finally, (* * *) is just tr((d^'5(t))2): 



^ ' j<keSaj,ak=0 



which simplifies to 



E tr([af (A.13) 



, 2\S\(\S\ - 1), , , , 1 , 

After straightforward manipulations the expression further reduces to 



, 2\S\(\S\ - 1), , , , 1 V- , 



n{n — 1) 2n{n 



2\S 



Yy ^su{k}dt - ^s\{j}u{k}dt. (A.15) 



n(n J. , ,„ J- , 



Combining (*), (**) and (* * *) then averaging over the realizations of the Brownian motion yields 
the following system of coupled ODE's: 

dhsit) _ 2\S^\ ^ f 2\S^\i\S^\-l) 2|g|(|5|-l) \ 



2|S'| — 1 — 

^su{k} - [y ^s\{i}u{fc}- (A. 16) 



^ ' keS'= ^ ' j&s 

keS" 

B Solutions of the purity ODE system 

This appendix discusses solutions of the system of ODE's 

We have investigated these equations numerically with initial conditions hk = 1, and found a loga- 
rithmic behavior in the ratio of scrambling times r* = t^"'/tl ~ log n. Here, we will give a heuristic 
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Figure 6. Largest nonzero eigenvalue Ai(7i) and its corresponding eigenvector Ai{k,n) are computed nu- 
merically: a) the inverse of Ai with a negative sign is plotted as a function of n suggesting Ai ~ —f3/n with 
/3 ~ 2.99964 b) the eigenvector Ai{k, lO'^) is multiplied by 2*^ to shows the power law fc" with a ~ 1.346. 



analytical argument for this behavior. For small values of ^ and large n, the system in (B. 1) simplifies 
to _ 

^ = k {2hk-i + 2hk+i - hhk) , (B.2) 

where r = t/n. Define the tridiagonal matrix M by M^^k = — 5A; and Mk^k±i = 2k with k = 
0, . . . , n where the first row A; = is all zeros. Denote the eigenvalues of M by \j. The eigenvector 
corresponding to A = is simply E^^ = 2~^. The eigenvalue problem ME^-^^ = XE'^'^^ gives a set 
of recursive equations for E'^^^ which have solutions of the form 

Ei'^ = k2~^F,^k + l,^,2,iy (B.3) 

where 2F1 is the Gaussian hypergeometric function. These eigenvectors blow up in the Umit k ^ 00 
unless A = — 3j with j a positive integer. The general solution to (B.2) in the limit n — )■ 00 is given 
by 

00 

j=0 

= 2-^aQ + Y.ajk 2-SFi (k + l,l- j, 2, ^ e'^^'^ (B.4) 
i=i ^ ^ 

At late time, the largest contribution comes form the zero eigenfunction, which selects oq = 1. We 
can get a sense for the relaxation time by examining the eigenfunction corresponding to the second 
eigenvalue, namely the term with j = \. Direct evaluation of the hypergeometric function (which 
reduces to a polynomial in the above case) shows that the contribution of the j = I eigenvalue is 
proportional to 2^^fcaie^^^. Provided that the first correction qualitatively reflects the higher order 
corrections (which is does if aj decreases appropriately with j), we find ~ log k, so that r* ~ log n. 
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Next, we turn to a numerical study of the eigenvectors for subsystems of larger k/n. Similarly, 
the solutions will have the general form 

n 

7^fc(t) = 5^a,e^^(")%(A:,n), (B.5) 

where the Aj(n)'s are eigenvalues of the matrix B (and therefore /c-independent), and the Aj{k,n) 
are the corresponding eigenvectors. It is only the largest nonzero eigenvalue and eigenvector that are 
important for scrambling time. As can be seen in Figure 6, numerical results suggest that the largest 
nonzero eigenvalue Ai ~ — 3/n and its corresponding eigenvector Ai{k) ~ 2^^ k"' for a ~ 0(1)- 

C r-body interactions and the BFSS matrix model 

Here, we revisit the Lieb-Robinson argument presented above for systems with r-body nonlocal in- 
teractions. The Hamiltonian for such systems has the form H = J2x where the sum is over 
subsets of maximum size r and Hx acts on (>Sixex'Hx- We will restrict our analysis to systems where 
r is a constant, not a function of n. In analogy with the interaction graphs introduced in Section 5.2, 
here the system can be represented by a hypergraph. Motivated by the fast scrambling conjecture, we 
focus on the BFSS matrix model as an example of a Hamiltonian with multi-body interactions, but the 
same type of argument can be used for other systems with r-body interactions, including those with 
complete r-uniform hypergraph Hamiltonians. The bosonic part of the Hamiltonian has the form 

= ^ tr M^M" + tr[M", M*]^ 

a a,b 

= ^M^.Mj^ + 2 ^ (Mf^M%M'^,M^,-Mt,M^^MlX), (d) 

a,i,j a,b,i,j,k,l 

where the indices a and b range from 1 to 9 and the are n by n traceless Hermitian matrices. The 
degrees of freedom M? are indexed by triples (a, i, j) with i < j. The operators in the Hamiltonian 
have unbounded norm, so strictly speaking the Lieb-Robinson approach cannot be used. In this section 
we nonetheless proceed formally as if the operators had bounded norm in order to determine whether 
the counting is consistent with a logarithmic signalling time. 

The kinetic term M^^M^j in (C.l) is a single-body interaction, whereas the potential term is 
comprised of 4-body interactions of the form 

M^^M^.M^iM^, and M^^M^^mIiMI 

Repeating the same arguments as in the case of two-body interactions, for hypergraphs we find the 
inequality:-^ 

CB{A,t)<2 I dsCB{Z,s)\\Hzl (C.2) 

^The discussion here parallels Appendix A of [28] 
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where Z is any multiset^ of one or four degrees of freedom that has a nonzero contribution Hz to the 
Hamiltonian. Cb^Z, t) itself is bounded from above by 



CB{Z,t)<CB{Z,0) + 2 f dsCB{Z\s)\\Hz 

At t = the operators Oa in A and Osvay commute and therefore 



(C.3) 



Iterating the above inequality, one obtains 



2||Ob||, for ZBy 
0, otherwise. 



(C.4) 



CB{A,t) < 2\\0b\ 



\ 



(2i) Yl 11^^11 + 



Z:Zr\Aj^<ll,ZBy 



2! 



E 



l^zllll-f^z'll + 



V 



Z.Z': 
ZnZ'^fl),Z'3y 



The contribution of each degree of freedom (a, to the energy is bounded by 



Y \\Hz\\ < \\M-\\' + 8Y[MjM^kM^iM^,\\ + \\M^^^^^ 

Z5{a,i,j) 



(C.5) 



(C.6) 



Note that the potential part of the above energy bound has O(n^) terms. We require the kinetic and 
the potential parts to be separately finite in the limit n — )• oo. One way to satisfy this is to introduce 
the following constraints: 



\Hx\\ < 4 



yx : \x\ 



(C.7) 



for positive constants c and p. 

We are interested in finding an upper bound for the right hand side of (C.5). This requires count- 
ing the number of terms in the i*^ sum in (C.5). Figure 7 illustrates the type of subsets that correspond 
to the terms in the sum. Using the constraints in (C.7), the i*^^ term can be bounded from above by 



i-l 



E ii^'.ii-ii^f.iifiE 



E 



Zi,Z2---,Zi: 



ZinZi.ij^d,Zi3y 



k=0 Xi,X2---,Xi_k: 

Xi_fcnXi_fe_i^0,Xi_fe9?/ 



P \ 2 



c \« 



(C.8) 



where k is the number of single -body multisets among Zi , • • • , and 4-body multisets are denoted 
by X. (^,) counts the number of ways of choosing kofi multisets to have only one degree of freedom. 
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A Zi Z2 Z4 Z^ Zy 



Figure 7. The interaction hypergraph of the BFSS matrix model includes hyperedges that contain one or four 
vertices. The Lieb-Robinson bound in (C.5) is found by summing over a set of hyperedges that contain a path 
between y and A. This figure illustrates a typical path connecting y and A with seven hyperedges. 



Next, we focus on counting the number of terms in the sum on the right hand side of (C.8). Denote 
this number by Pi-k- If is the number of ways Xj can intersect ^j+i, then 

Pi-k < P{A,1) P{1,2) P(2,3) • • • P(i-k~l,i-k) ■ (C.9) 

Notice that each four-body interaction term M""^^ Mj^ M^f Mf^ in the Hamihonian has four indices 
k and / that run from 1 to n. Fixing one degree of freedom fixes two of these indices, while 
fixing a second degree of freedom leaves only one index. Therefore, is order if y ^ ^j+i 

and is order n if y G ^j+i- Since y has to belong to Xj for some j, there are a maximum of 
P = O nonzero terms in the sum (C.8). Plugging this result back in (C.8) gives 

Z\,Z2---,Zi: 

= ^ ((c + p)^-p*) +0(n-2) . (C.IO) 
for some positive constant c'. Now from (C.5) we find the inequality 

Cb{A, t) < "i^^ E ^ ((c + pY -p')+0 

i=\ 

= l^M^ (e2('=+f)* - e^P*) + O (n-2) . (C.ll) 

This finishes the "proof" that in the BFSS matrix model, signalling takes time at least tgignai > 
0(log n). Of course, we have really just proved the weaker statement that a logarithmic lower bound 
holds for a related system with bounded operators in its Hamiltonian. It is therefore conceivable 
that this proof could be adapted to hold for the real BFSS Hamiltonian for all states in a low energy 

''a multiset is a generalization of a set in which members can be repeated. 



" - n ^ k\{i - k)\ \cJ ^ ^ 

k=o ^ ' 
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subspace. Alternatively, Lieb-Robinson bounds for lattice systems have been proved for some Hamil- 
tonians containing unbounded operators [57]. Similar techniques might be applicable to the matrix 
model. 
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